From a Computational Linguistic Atlas to Dialectal Lexical Resources

نویسندگان

  • Simonetta MONTEMAGNI
  • Eugenio PICCHI
چکیده

Computers can help dialectologists to make full use of the information they have acquired: the basic dimensions of dialectal reaserch can be enlarged and its possible outcomes can become more sophisticated. In this paper, we show how a dialectal database, DBT-ALT, containing the data collected for the Atlante Lessicale Toscano ‘Lexical Atlas of Tuscany’ can be used as the starting point for the production of dialectal dictionaries and other kinds of lexicographic resources provided that adequate computational tools are available to carry out the job properly. First, the architecture and functioning of DBT-ALT are described in detail. Second, we show how DBT-ALT access functionalities can be exploited to extract subsets of data which could be converted into independent lexicographic resources through the operation of a Lexicographic Workstation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dialectal resources on-line: the ALT-Web experience

The paper presents an on-line dialectal resource, ALT-Web, which gives access to the linguistic data of the Atlante Lessicale Toscano, a specially designed linguistic atlas in which lexical data have both a diatopic and diastratic characterisation. The paper focuses on: the dialectal data representation model; the access modalities to the ALT dialectal corpus; ontology-based search.

متن کامل

Patterns of language variation and underlying linguistic features: a new dialectometric approach

For almost forty years quantitative methods have been applied to the analysis of dialect variation: these methods focused mostly on identifying the most important dialectal groups using an aggregate analysis of the linguistic data (Séguy 1973; Goebl 1984; Nerbonne et al. 1999). While viewing dialect differences at an aggregate level certainly gives a more comprehensive view than the analysis of...

متن کامل

Tharwa: A Large Scale Dialectal Arabic - Standard Arabic - English Lexicon

We introduce an electronic three-way lexicon, Tharwa, comprising Dialectal Arabic, Modern Standard Arabic and English correspondents. The paper focuses on Egyptian Arabic as the first pilot dialect for the resource, with plans to expand to other dialects of Arabic in later phases of the project. We describe Tharwa’s creation process and report on its current status. The lexical entries are augm...

متن کامل

Dialectal Atlas of the Arab World - between Intention and Reality

Arabic dialectology has a long history and achieved significant progress in collecting and analyzing linguistic data and its classification. The present paper analyses modern trends in the linguistic situation in the Arab world and defines the topics essential for the Arabic dialectology, which require an urgent solution. During the last century, several attempts have been undertaken to create ...

متن کامل

Developing and Using a Pilot Dialectal Arabic Treebank

In this paper, we describe the methodological procedures and issues that emerged from the development of a pilot Levantine Arabic Treebank (LATB) at the Linguistic Data Consortium (LDC) and its use at the Johns Hopkins University (JHU) Center for Language and Speech Processing workshop on Parsing Arabic Dialects (PAD). This pilot, consisting of morphological and syntactic annotation of approxim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000